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1.  INTRODUCTION 


During  the  past  decade,  the  minicomputer  Industry  has  experienced 
an  explosive  period  of  growth,  in  terms  of  technological  advances 
and  market  volume.  According  to  recent  Datapro  Research  Corporation 
Reports,  estimates  of  worldwide  minicomputer  market  volumes  are: 

1972  [1]  $300  - $A50  million 

1975  [21  $800  million  - $1.4  billion 

1977  [21  $1.8  billion. 

These  figures  are  rather  striking  by  themselves  even  if  we  do 
not  take  into  account  the  rapid  decrease  in  the  cost  of  central 
processors.  Kenney  [101  wrote,  "In  1966,  for  example,  the  processor 
cost  approximately  $30,000,  but  six  years  later,  1972,  its  price 
was  only  20  percent  of  that  cost,  about  $6,500."  Monrad-Krohn  [121 
(1977)  estimated,  "The  central  processing  element  of  a computer 
has  decreased  to  the  cost  of  about  $20." — of  course,  he  was 
referring  to  the  lower  spectrum  of  present  generation  of  micro 
computers. 

During  this  period  of  explosive  growth,  technological  advances 
in  the  hardware  components  have  far  exceeded  the  development  of  soft- 


ware. The  following  quotes  are  fairly  typical  of  current  opinions 
about  minicomputer  software: 


"The  present  state  of  software  development  is  far  from  being 
acceptable  . . . Development  of  the  software  takes  longer  than 
anticipated  and  almost  always  the  costs  are  more  than  expected,  r % 

At  times  the  finished  product  does  not  perform  as  expected,  THiHi 

and  there  have  been  times  when  it  didn't  perform  at  all."  lun  SsctlM  □ 

[10,  p.  761  |uNwnwuiica>  □ 
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"Software,  which  had  long  received  only  cursory  attention  form 
the  predominantly  hardware-oriented  minicomputer  makers,  is 
rapidly  becoming  the  principal  distinghlshing  factor  between 
competitive  product  lines."  [2,  p 70c*010-20d] 

Given  the  state  of  general  software  development  of  minicomputers, 
it  should  be  no  surprise  that  existing  statistical  software  for 
minicomputers  is  fragmented,  localized,  and  often  primitive. 

Some  manufacturers  (such  as  Hewlett-Packard)  serve  as  the  distributor  of 
user-contributed  software.  Including  statistical  programs  and  systems. 

In  such  cases,  the  lack  of  quality  control  standards  for  contributed 
programs  resulted  in  many  library  programs  that  are  low  In  quality,  by  any 
reasonable  standards  of  evaluation.  Portable  statistical  systems  for 
minicomputers,  interactive  or  not,  are  almost  nonexistent.  MlnlBMD 
[5]  is  perhaps  the  first  serious  attempt  at  the  creation  of  a portable, 
high  quality,  general  purpose  statistical  system  specifically  designed 
for  minicomputers. 

For  the  aforementioned  reasons.  Instead  of  doing  a survey  of 
existing,  non-portable,  statistical  software,  I shall  consider  some 
characteristics  of  portable  statistical  software  for  minicomputers 
in  the  immediate  future  by  focussing  on  constraints  imposed  by  such 
computers  on  the  design  and  implementation  of  Interactive  statistical 
systems.  In  my  opinion,  interactive  systems  are  of  paramount  importance 
in  the  effective  use  of  statistics  on  minicomputers,  and  the  effective 
design  of  such  systems  must  pay  close  attention  to  the  constraints. 
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2.  WHAT  IS  A MINICOMPUTER? 

One  agreement  within  the  minicomputer  Industry  is  that  there 
Is  disagreement  as  to  what  constitutes  a minicomputer.  For  the 
purpose  of  the  present  discussion,  I shall  use  the  pseudo-definition 
"minicomputers  are  machines  whose  mainframes  sell  for  less  than 
$50,000  (or  some  other  arbitrary  figure)"  In  the  spirit  minicomputers 
are  defined  In  [2].  A typical  system  configuration  costs  two  to 
four  times  the  cost  of  the  mainframe.  There  are  no  clear  cutoff 
values  that  separate  minis  from  micros  and  midis  (see  e.g.  [12,  15]). 
For  example,  Interdata  8/32  Is  classified  as  a mini  In  [2]  and  a mldl 
In  [15].  Given  the  trend  of  Increasing  computer  power  and  decreasing 
cost,  the  next  generation  of  minis  will  likely  be  comparable  to 
some  of  today's  maxis  in  capacity  and  performance. 

The  most  Important  distinguishing  characteristic  of  a mini  Is 
its  word  length.  A "typical"  mini  currently  on  the  market  has  a 
16-blt  word  length,  although  minis  with  word  lengths  of  as  many 
as  32  bits  or  as  low  as  8 bits  are  not  rare.  For  a minicomputer 
which  Is  capable  of  supporting  a moderately  versatile  Interactive 
statistical  system,  we  may  consider  the  following  to  be  some  of 
its  "typical"  characteristics: 

Software  support:  a time  sharing  operating  system. 

BASIC  and/or  FORTRAN  compilers. 

Main  Format:  16-blt  word  length  (and  up). 

Main  storage:  magnetic  core  having  a maximum  storage 
capacity  of  32768  words  (and  up). 

I/O  control:  DMA  channel  and  multilevels  of  external 
Interrupt. 

Peripheral:  disk  pack  or  cartridge  drives,  tape  delves  and 
other  standard  I/O  devices. 
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3.  CHOICE  OF  COMPUTER  AND  INTERACTIVE  SYSTEM  DESIGN  — 

WHICH  COMES  FIRST  OR  SHOULD  IT  MATTER? 

From  the  system  designer's  point  of  view,  two  general  optlmltstlon 
approaches  are  possible: 

(A)  Consider  an  Ideal  design  of  an  Interactive  system  and  then 
choose  a computer  whose  characteristics  are  most  suitable  for  the 
Implementation  of  that  design. 

(B)  Given  a computer  and  Its  associated  software,  design  an 
Interactive  system  which  attempts  to  make  optimal  use  of  the  available 
features  and  resources. 

In  practive,  approach  (A)  is  generally  not  available  to  the 
statistical  system  designer;  and  Judging  from  the  characteristics 
of  existing  Interactive  statistical  software  for  large  and  small 
computers,  approach  (B)  appears  to  be  the  norm.  As  a result,  most 
of  them  (e.g.,  IDA  [ll] , isp  [41,  MIDAS  (6,  7],  SAS  [13],  SIPS  [9], 
and  SPEAKEASY  [14])  achieve  certain  desirable  features  or  local 
optimality  at  the  expense  of  severely  limited  portability. 

If  we  use  the  criteria  for  evaluating  statistical  software  In 
[8,  16]  as  guidelines  for  designing  an  Interactive  system,  then  neither 
approach  (A)  nor  approach  (B)  would  be  appropriate.  Instead,  the 
system  designer  should  first  consider  the  constraints  imposed  by 
the  requirement  of  portability  to  choose  the  software  language  used 
to  code  the  Interactive  system  (e.g.,  at  the  present  time,  neither 
APL  nor  PL/I  would  be  an  appropriate  choice  because  most  minicomputers 
do  not  have  an  Interpreter  or  compiler  for  these  languages,  although 
purely  from  a programming  language  point  of  view,  they  are  In  many 
respects  better  than  their  counterparts  BASIC  and  FORTRAN  which  are 
widely  supported.) 
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Our  experience  with  existing  interactive  systems  should  have 
taught  us  a lesson  about  the  importance  of  portability.  Far  too 
often,  system  designers  (myself  included)  exhibit  systems  with  many 
desirable  features  but  unfortunately  have  to  inform  those  who  are 
interested  in  using  the  system  that  it  cannot  run  under  machine 
ABC  or  operating  system  XYZ  without  substantial  conversion  efforts. 

In  order  to  consider  a truly  portable  system,  we  are  not  only  con- 
strained to  use  BASIC  or  FORTRAN,  but  we  must  sacrifice  certain 
features  of  a system  if  their  implementation  would  require  non-standard 
features  of  those  languages.  Similarly,  other  constraints  imposed 
by  minicomputers  should  be  carefully  considered  before  a system 
is  designed  or  implemented. 
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4. 


CONSTRAINTS  IMPOSED  BY  MINICOMPUTERS 


The  major  categories  of  evaluation  criteria  and  their  dependence 
on  the  characteristics  of  a ''typical*'  minicomputer  can  be  summarized 

i 

i by  Figure  I.  The  diagram  suggests  that  the  partition  size  (which  is 

generally  a function  of  the  primary  core  size)  plays  an  important  role 

I 

I in  all  aspects  of  a statistical  system  design. 

' Figure  2 gives  a schematic  representation  of  some  typical  imple- 

mentations (using  BA<;iC  or  FORTRAN  as  the  source  language)  that 
further  restricts  the  space  available  for  active  data  and  system 
parameters.  In  general,  the  use  of  FORTRAN  places  much  greater 
constraint  on  the  total  size  (and  hence  extensibility)  of  a system  while 
the  most  favorable  language  for  modularizing  a large  system  (BASIC  with 
CHAIN  and  COMMON)  is  likely  to  have  severe  portability  problems.  The 
constraints  that  effect  each  of  several  majot  evaluation  items  will 
be  elaborated  below: 


A.  User  Interface 

A. 1 Date  structure  and  size  of  active  data 

The  most  distinguishing  feature  between  a statistical  system 
on  a minicomputer  and  one  on  a maxicomputer  is  probably  the  total 
size  of  the  "active"  arrays  (variables  addressable  in  the  primary 
core).  For  a system  running  on  a maxicomputer  with  a 256K  partition 
size,  say,  the  space  allocatable  to  active  arrays  generally  exceeds 
the  space  on  a mini  allocatable  to  the  entire  system.  Thus,  in 
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Figure  1 
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Figure  2 
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structured  to  Interface  efficiently  with  data  stored  In  secondary 
memory  locations  and  devices,  whereas  a maxi  system  may  have  sufficient 
space  to  place  the  entire  dataset  in  core.  Moreover,  a BASIC  system 
without  the  COMMON  feature  will  require  explicit  I/O  to  pass  variables 
and  system  parameters  among  modules  or  subprograms,  thereby  exacting 
a heavy  overhead  on  the  performance  of  the  system. 

A. 2 Command  language  structure 


All  interactive  systems  must  have  a command  language  structure. 
The  syntax  of  the  structure  may  range  from  simply  a dictionary  of 
COMMAND  WORDS  to  one  admitting  flexible  combinations  of  language 
phrases  and  arithmetic  expressions.  The  latter  will  require  a parsing 
algorithm  to  interpret  the  command  or  control  phrases.  The  partition 
size  of  a minicomputer  will  greatly  curtail  the  space  allocatable  to 
the  algorithm  and  thus  will  limit  its  complexity  and  generality. 

A. 3 Level  of  interaction  between  User  and  System 


The  minicomputer  itself  has  relatively  small  effect  on  this 
aspect  of  the  software  design.  The  source  language  used  and  the 
mode  of  communication  between  the  main  (driver)  program  and  subroutines 
(modules)  and  among  modules  will  determine  the  efficiency  of  the  inter- 
action (provided  the  system  is  optimally  designed  and  coded  for  man-machine 
interaction).  For  example,  of  the  two  types  of  BASIC  illustrated  in 
Figure  2,  the  one  with  CHAIN  and  COMMON  is  much  more  amenable  to  a 
flexible  structure  for  user-system  interaction  than  its  counterpart. 


\ 
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the  standard  BASIC  . 
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A.  4 Internal  documentation 

Ideally,  the  user  of  an  Interactive  system  ought  to  be  able  to 
access  all  relevant  Information  and  documentation  about  the  system 
on  line,  without  the  necessity  of  a User's  Manual  or  various  reference 
manuals.  In  practice,  no  existing  system  accomplishes  this  Ideal, 
though  some  (such  as  SPEAKEASY,  with  several  hundred  pages  of  text  In 
the  HELP  file,  hierarchically  organized  In  a tree  structure)  come 
much  closer  to  an  Internally-documented  system  than  others.  For  mini- 
computers, even  considerable  less  text  than  that  In  the  SPEAKEASY 
system  would  be  constrained  by  the  limited  partition  size.  Thus,  only 
the  most  frequently  accessed  documentation  can  be  kept  In  core  while 
the  others  must  be  retrieved  from  secondary  or  peripheral  storage 
devices. 

Statistical  Effectiveness 

B.  1 Statistical  versatility 

The  statistical  versatility  of  a system  Is  constrained  primarily 
by  the  partition  space  utilization  as  Illustrated  In  Figure  2,  so  that 
the  constraint  Is  much  more  severe  for  a FORTRAN  system  than  one  In 
BASIC. 

A comment  is  perhaps  necessary  here  to  clarify  the  assertion  that 
a system  written  in  FORTRAN  has  greater  constraints  on  added  statistical 
capabilities  than  one  written  in  BASIC.  In  a FORTRAN  environment, 
statistical  as  well  as  I/O  tasks  that  are  common  to  many  procedures 
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(modules)  are  accomplished  by  a CALL  SUBROUTINE  statement  within  the 
module  with  the  subroutines  being  called  resident  in  core  at  all 
times.  Thus,  as  a system  grows,  there  will  be  more  and  more  of  such 
"utility"  subroutines.  In  a BASIC  environment,  the  implicit  subroutine 
call  feature  does  not  exist,  so  that  often  the  identical  codes  (or 
codes  with  different  names)  are  explicitly  coded  within  each  and  every 
subprogram  or  module  of  the  system,  as  a matter  of  necessity  imposed 
by  the  language.  In  theory,  if  we  simulate  this  form  of  inefficiency 
in  FORTRAN  (by  discarding  the  effective  use  of  subroutines)  then  the 
overlay  structure  in  FORTRAN  is  no  different  from  the  chaining  structure 
in  BASIC  insofar  the  programmer  is  concerned.  However,  it  appears 
reasonable  to  assume  that  when  one  is  working  within  a portable 
FORTRAN  environment  (having  sacrificed  many  non-standard  but  more 
powerful  features)  one  is  entitled  to,  and  should,  make  effective 
use  of  the  SUBROUTINE  features  in  FORTRAN  while  paying  a price  in  the 
extensibility  of  a large  system. 

B. 2 Numerical  accuracy 

The  primary  constraint  is  the  word  length  of  a minicomputer  which 
limits  the  achievable  numerical  accuracy  of  the  minicomputers. 

Typically,  minicomputers  do  not  have  the  option  to  perform  computations 
in  double-precision  arithmetic  while  many  statistical  computational 
algorithms  require  double-precision  to  ensure  a high  degree  of  accuracy. 
A secondary  constraint  may  be  considered  to  be  the  CPU  speed  of 
arithmetic  operations  because  algorithms  capable  of  achieving  a high 
degree  of  numerical  accuracy  at  the  expense  of  "number  crunching" 
may  have  to  be  discarded  in  favor  of  less  accurate,  but  much  speedier 


algorithms. 
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C.  Impl ement  at Ion 

C. 1 Extensibility 

The  Implementation  of  a system  should  make  allowances  for  two  types 
of  modification  or  extension: 

(1)  Added  system  capabilities  (new  commands  or  procedures). 

(2)  Accommodations  of  user-supplied  procedures  or  routines. 

The  feasibility  and  ease  of  implementing  these  depend  heavily  on 
the  software  language  used  to  code  the  system  and  to  some  extent 
on  the  operating  system  on  which  the  package  is  run.  Typically 
such  extensions  are  much  more  easily  accomplished  in  BASIC  (or  any 
interpretive  language)  than  in  FORTRAN  (which  requires  compilation  ^ 
linking,  and  the  creation  of  a new  load  module  for  the  entire 
system  before  execution  of  the  new  procedure  can  take  place).  At 
the  present  state  of  affairs,  I would  assess  the  extensibility  of 
a FORTRAN  system  to  be  moderately  clumsy  to  fair  for  the  system 
implementor,  and  difficult  to  impossible  for  the  user.  On  the  other 
hand,  extending  a system  written  in  BASIC  is  generally  simple  and 
straight-forward . 

C. 2 Portability 

Among  all  of  the  evaluation  criteria  of  a statistical  system, 
portability  is  probably  the  most  challenging  one  to  satisfy  as 
well  as  one  which  is  much  more  restrictive  than  it  may  seem.  The 
major  constraint  lies  in  the  fact  that  even  for  commonly  used 
languages  such  as  BASIC  and  FORTRAN,  different  manufacturers  of 
minicomputers  support  different  features  of  the  languages  ( as  well 
as  compiler  and  operating  system  for  each  of  the  languages).  Con- 
sequently, to  achieve  portability,  often  certain  desirable  features 
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have  to  be  sacrificed  (e.g.,  efficient  coding,  efficient  I/O,  and 
optimal  Interrupt  handling  and  error  recovery)  In  order  that  the 
system  can  be  run  without  modification  on  different  computers. 


5.  LOOKING  AHEAD  TOWARDS  THE  NEXT  GENERATION 


In  this  paper,  I presented  my  impression  of  the  constraints 
Imposed  by  the  present  generation  of  minicomputers  on  the  design  and 
Implementation  of  Interactive  statistical  systems.  Given  the  present 
rate  of  technological  advances  and  decrease  in  the  cost  of  the  hard- 
ware, It  appears  likely  that  the  next  generation  of  minicomputers  will 
approach  or  surpass  most  of  the  present  generation  maxicomputers  In 
capacity  and  performance.  As  a result,  many  of  the  existing  constraints 
will  be  partially  or  totally  removed  simply  as  a natural  consequence 
of  progress.  However,  constraints  In  the  portability  of  software  will 
likely  remain  In  the  near  future;  and  may  be  better  or  worse  in  the 
Intermediate  future,  depending  on  the  demands  of  the  "buyers"  and  the 
manufacturers'  assessments  of  the  needs  of  the  existing  and  potential 
market.  In  either  case,  the  scientific  computing  community  In  general 
and  the  statistical  computing  community  In  particular  (both  being 
small  minorities  In  the  computing  market  of  consumers)  will  be 
unlikely  to  have  any  major  Impact  on  the  manufacturers'  hardware  and 
software  designs.  Thus,  even  if  it  becomes  technologically  feasible 
to  eliminate  all  of  the  constraints  discussed  In  the  paper  for  mini- 
computers, some  of  them  will  remain  because  of  the  diversity  of  demands 
of  different  groups  of  users. 
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